Dataset statistics
| Number of variables | 40 |
|---|---|
| Number of observations | 380932 |
| Missing cells | 250345 |
| Missing cells (%) | 1.6% |
| Duplicate rows | 11 |
| Duplicate rows (%) | < 0.1% |
| Total size in memory | 810.5 MiB |
| Average record size in memory | 2.2 KiB |
Variable types
| Text | 1 |
|---|---|
| Categorical | 10 |
| Numeric | 6 |
| Boolean | 23 |
| Dataset has 11 (< 0.1%) duplicate rows | Duplicates |
HeightInMeters is highly overall correlated with WeightInKilograms and 1 other fields | High correlation |
WeightInKilograms is highly overall correlated with HeightInMeters and 1 other fields | High correlation |
BMI is highly overall correlated with WeightInKilograms | High correlation |
Sex is highly overall correlated with HeightInMeters | High correlation |
AgeCategory is highly overall correlated with PneumoVaxEver | High correlation |
PneumoVaxEver is highly overall correlated with AgeCategory | High correlation |
HadHeartAttack is highly imbalanced (68.0%) | Imbalance |
HadAngina is highly imbalanced (66.5%) | Imbalance |
HadStroke is highly imbalanced (73.9%) | Imbalance |
HadSkinCancer is highly imbalanced (59.0%) | Imbalance |
HadCOPD is highly imbalanced (59.0%) | Imbalance |
HadKidneyDisease is highly imbalanced (72.6%) | Imbalance |
HadDiabetes is highly imbalanced (59.4%) | Imbalance |
DeafOrHardOfHearing is highly imbalanced (55.4%) | Imbalance |
BlindOrVisionDifficulty is highly imbalanced (68.7%) | Imbalance |
DifficultyDressingBathing is highly imbalanced (75.6%) | Imbalance |
DifficultyErrands is highly imbalanced (60.3%) | Imbalance |
HighRiskLastYear is highly imbalanced (74.2%) | Imbalance |
PhysicalHealthDays has 8988 (2.4%) missing values | Missing |
MentalHealthDays has 7420 (1.9%) missing values | Missing |
LastCheckupTime has 6793 (1.8%) missing values | Missing |
SleepHours has 4376 (1.1%) missing values | Missing |
RemovedTeeth has 9230 (2.4%) missing values | Missing |
ChestScan has 16179 (4.2%) missing values | Missing |
RaceEthnicityCategory has 10780 (2.8%) missing values | Missing |
AgeCategory has 6348 (1.7%) missing values | Missing |
HeightInMeters has 8631 (2.3%) missing values | Missing |
WeightInKilograms has 20361 (5.3%) missing values | Missing |
BMI has 25608 (6.7%) missing values | Missing |
AlcoholDrinkers has 4901 (1.3%) missing values | Missing |
HIVTesting has 18321 (4.8%) missing values | Missing |
PneumoVaxEver has 29798 (7.8%) missing values | Missing |
TetanusLast10Tdap has 34687 (9.1%) missing values | Missing |
PhysicalHealthDays has 229002 (60.1%) zeros | Zeros |
MentalHealthDays has 226066 (59.3%) zeros | Zeros |
Reproduction
| Analysis started | 2023-11-20 14:31:43.616747 |
|---|---|
| Analysis finished | 2023-11-20 14:33:47.046204 |
| Duration | 2 minutes and 3.43 seconds |
| Software version | ydata-profiling vv4.5.1 |
| Download configuration | config.json |
State
Text
| Distinct | 54 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 23.7 MiB |
Length
| Max length | 20 |
|---|---|
| Median length | 12 |
| Mean length | 8.3455525 |
| Min length | 4 |
Characters and Unicode
| Total characters | 3179088 |
|---|---|
| Distinct characters | 46 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Alabama |
|---|---|
| 2nd row | Alabama |
| 3rd row | Alabama |
| 4th row | Alabama |
| 5th row | Alabama |
| Value | Count | Frequency (%) |
| new | 30806 | 6.7% |
| washington | 22388 | 4.9% |
| south | 15350 | 3.4% |
| york | 14586 | 3.2% |
| minnesota | 13887 | 3.0% |
| ohio | 13647 | 3.0% |
| maryland | 13362 | 2.9% |
| virginia | 13340 | 2.9% |
| carolina | 12471 | 2.7% |
| texas | 11979 | 2.6% |
| Other values (50) | 294635 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 412273 | |
| i | 301451 | 9.5% |
| n | 283027 | 8.9% |
| o | 270327 | 8.5% |
| s | 223560 | 7.0% |
| e | 183171 | 5.8% |
| r | 160799 | 5.1% |
| t | 146492 | 4.6% |
| h | 107351 | 3.4% |
| l | 91189 | 2.9% |
| Other values (36) | 999448 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2649785 | |
| Uppercase Letter | 453784 | 14.3% |
| Space Separator | 75519 | 2.4% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 412273 | |
| i | 301451 | |
| n | 283027 | |
| o | 270327 | |
| s | 223560 | |
| e | 183171 | 6.9% |
| r | 160799 | 6.1% |
| t | 146492 | 5.5% |
| h | 107351 | 4.1% |
| l | 91189 | 3.4% |
| Other values (14) | 470145 |
Uppercase Letter
| Value | Count | Frequency (%) |
| M | 76342 | |
| N | 48184 | |
| W | 39947 | 8.8% |
| C | 39570 | 8.7% |
| I | 32212 | 7.1% |
| O | 24038 | 5.3% |
| A | 22155 | 4.9% |
| V | 21965 | 4.8% |
| D | 16865 | 3.7% |
| T | 16517 | 3.6% |
| Other values (11) | 115989 |
Space Separator
| Value | Count | Frequency (%) |
| 75519 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 3103569 | |
| Common | 75519 | 2.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 412273 | |
| i | 301451 | 9.7% |
| n | 283027 | 9.1% |
| o | 270327 | 8.7% |
| s | 223560 | 7.2% |
| e | 183171 | 5.9% |
| r | 160799 | 5.2% |
| t | 146492 | 4.7% |
| h | 107351 | 3.5% |
| l | 91189 | 2.9% |
| Other values (35) | 923929 |
Common
| Value | Count | Frequency (%) |
| 75519 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3179088 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 412273 | |
| i | 301451 | 9.5% |
| n | 283027 | 8.9% |
| o | 270327 | 8.5% |
| s | 223560 | 7.0% |
| e | 183171 | 5.8% |
| r | 160799 | 5.1% |
| t | 146492 | 4.6% |
| h | 107351 | 3.4% |
| l | 91189 | 2.9% |
| Other values (36) | 999448 |
Sex
Categorical
HIGH CORRELATION 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 22.5 MiB |
| Female | |
|---|---|
| Male |
Length
| Max length | 6 |
|---|---|
| Median length | 6 |
| Mean length | 5.0579263 |
| Min length | 4 |
Characters and Unicode
| Total characters | 1926726 |
|---|---|
| Distinct characters | 6 |
| Distinct categories | 2 ? |
| Distinct scripts | 1 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Female |
|---|---|
| 2nd row | Female |
| 3rd row | Female |
| 4th row | Female |
| 5th row | Male |
Common Values
| Value | Count | Frequency (%) |
| Female | 201499 | |
| Male | 179433 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| female | 201499 | |
| male | 179433 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 582431 | |
| a | 380932 | |
| l | 380932 | |
| F | 201499 | 10.5% |
| m | 201499 | 10.5% |
| M | 179433 | 9.3% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1545794 | |
| Uppercase Letter | 380932 | 19.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 582431 | |
| a | 380932 | |
| l | 380932 | |
| m | 201499 | 13.0% |
Uppercase Letter
| Value | Count | Frequency (%) |
| F | 201499 | |
| M | 179433 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1926726 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 582431 | |
| a | 380932 | |
| l | 380932 | |
| F | 201499 | 10.5% |
| m | 201499 | 10.5% |
| M | 179433 | 9.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1926726 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 582431 | |
| a | 380932 | |
| l | 380932 | |
| F | 201499 | 10.5% |
| m | 201499 | 10.5% |
| M | 179433 | 9.3% |
GeneralHealth
Categorical
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 957 |
| Missing (%) | 0.3% |
| Memory size | 23.0 MiB |
| Very good | |
|---|---|
| Good | |
| Excellent | |
| Fair | |
| Poor |
Length
| Max length | 9 |
|---|---|
| Median length | 4 |
| Mean length | 6.4694651 |
| Min length | 4 |
Characters and Unicode
| Total characters | 2458235 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 3 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Very good |
|---|---|
| 2nd row | Excellent |
| 3rd row | Excellent |
| 4th row | Fair |
| 5th row | Poor |
Common Values
| Value | Count | Frequency (%) |
| Very good | 127352 | |
| Good | 122852 | |
| Excellent | 60315 | |
| Fair | 52426 | |
| Poor | 17030 | 4.5% |
| (Missing) | 957 | 0.3% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| good | 250204 | |
| very | 127352 | |
| excellent | 60315 | 11.9% |
| fair | 52426 | 10.3% |
| poor | 17030 | 3.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 534468 | |
| d | 250204 | |
| e | 247982 | |
| r | 196808 | 8.0% |
| V | 127352 | 5.2% |
| y | 127352 | 5.2% |
| 127352 | 5.2% | |
| g | 127352 | 5.2% |
| G | 122852 | 5.0% |
| l | 120630 | 4.9% |
| Other values (9) | 475883 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1950908 | |
| Uppercase Letter | 379975 | 15.5% |
| Space Separator | 127352 | 5.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 534468 | |
| d | 250204 | |
| e | 247982 | |
| r | 196808 | 10.1% |
| y | 127352 | 6.5% |
| g | 127352 | 6.5% |
| l | 120630 | 6.2% |
| t | 60315 | 3.1% |
| n | 60315 | 3.1% |
| c | 60315 | 3.1% |
| Other values (3) | 165167 | 8.5% |
Uppercase Letter
| Value | Count | Frequency (%) |
| V | 127352 | |
| G | 122852 | |
| E | 60315 | |
| F | 52426 | |
| P | 17030 | 4.5% |
Space Separator
| Value | Count | Frequency (%) |
| 127352 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2330883 | |
| Common | 127352 | 5.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 534468 | |
| d | 250204 | |
| e | 247982 | |
| r | 196808 | 8.4% |
| V | 127352 | 5.5% |
| y | 127352 | 5.5% |
| g | 127352 | 5.5% |
| G | 122852 | 5.3% |
| l | 120630 | 5.2% |
| t | 60315 | 2.6% |
| Other values (8) | 415568 |
Common
| Value | Count | Frequency (%) |
| 127352 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 2458235 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 534468 | |
| d | 250204 | |
| e | 247982 | |
| r | 196808 | 8.0% |
| V | 127352 | 5.2% |
| y | 127352 | 5.2% |
| 127352 | 5.2% | |
| g | 127352 | 5.2% |
| G | 122852 | 5.0% |
| l | 120630 | 4.9% |
| Other values (9) | 475883 |
PhysicalHealthDays
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8988 |
| Missing (%) | 2.4% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.3849908 |
| Minimum | 0 |
|---|---|
| Maximum | 30 |
| Zeros | 229002 |
| Zeros (%) | 60.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 3 |
| 95-th percentile | 30 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 3 |
Descriptive statistics
| Standard deviation | 8.7420914 |
|---|---|
| Coefficient of variation (CV) | 1.9936396 |
| Kurtosis | 3.3427103 |
| Mean | 4.3849908 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.1640866 |
| Sum | 1630971 |
| Variance | 76.424162 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 229002 | |
| 30 | 28883 | 7.6% |
| 2 | 21830 | 5.7% |
| 1 | 14936 | 3.9% |
| 3 | 13573 | 3.6% |
| 5 | 13031 | 3.4% |
| 10 | 9008 | 2.4% |
| 7 | 7866 | 2.1% |
| 15 | 7661 | 2.0% |
| 4 | 7212 | 1.9% |
| Other values (21) | 18942 | 5.0% |
| (Missing) | 8988 | 2.4% |
| Value | Count | Frequency (%) |
| 0 | 229002 | |
| 1 | 14936 | 3.9% |
| 2 | 21830 | 5.7% |
| 3 | 13573 | 3.6% |
| 4 | 7212 | 1.9% |
| 5 | 13031 | 3.4% |
| 6 | 2152 | 0.6% |
| 7 | 7866 | 2.1% |
| 8 | 1494 | 0.4% |
| 9 | 333 | 0.1% |
| Value | Count | Frequency (%) |
| 30 | 28883 | |
| 29 | 309 | 0.1% |
| 28 | 635 | 0.2% |
| 27 | 162 | < 0.1% |
| 26 | 92 | < 0.1% |
| 25 | 1876 | 0.5% |
| 24 | 99 | < 0.1% |
| 23 | 86 | < 0.1% |
| 22 | 118 | < 0.1% |
| 21 | 882 | 0.2% |
MentalHealthDays
Real number (ℝ)
MISSING  ZEROS 
| Distinct | 31 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 7420 |
| Missing (%) | 1.9% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.4157323 |
| Minimum | 0 |
|---|---|
| Maximum | 30 |
| Zeros | 226066 |
| Zeros (%) | 59.3% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.9 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 0 |
| median | 0 |
| Q3 | 5 |
| 95-th percentile | 30 |
| Maximum | 30 |
| Range | 30 |
| Interquartile range (IQR) | 5 |
Descriptive statistics
| Standard deviation | 8.4040866 |
|---|---|
| Coefficient of variation (CV) | 1.9032147 |
| Kurtosis | 3.3020549 |
| Mean | 4.4157323 |
| Median Absolute Deviation (MAD) | 0 |
| Skewness | 2.1097337 |
| Sum | 1649329 |
| Variance | 70.628671 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0 | 226066 | |
| 30 | 23209 | 6.1% |
| 2 | 20481 | 5.4% |
| 5 | 17184 | 4.5% |
| 10 | 13325 | 3.5% |
| 3 | 13174 | 3.5% |
| 15 | 12674 | 3.3% |
| 1 | 12440 | 3.3% |
| 20 | 7926 | 2.1% |
| 4 | 6868 | 1.8% |
| Other values (21) | 20165 | 5.3% |
| (Missing) | 7420 | 1.9% |
| Value | Count | Frequency (%) |
| 0 | 226066 | |
| 1 | 12440 | 3.3% |
| 2 | 20481 | 5.4% |
| 3 | 13174 | 3.5% |
| 4 | 6868 | 1.8% |
| 5 | 17184 | 4.5% |
| 6 | 1997 | 0.5% |
| 7 | 6834 | 1.8% |
| 8 | 1476 | 0.4% |
| 9 | 260 | 0.1% |
| Value | Count | Frequency (%) |
| 30 | 23209 | |
| 29 | 418 | 0.1% |
| 28 | 779 | 0.2% |
| 27 | 206 | 0.1% |
| 26 | 90 | < 0.1% |
| 25 | 2669 | 0.7% |
| 24 | 104 | < 0.1% |
| 23 | 86 | < 0.1% |
| 22 | 165 | < 0.1% |
| 21 | 472 | 0.1% |
LastCheckupTime
Categorical
MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6793 |
| Missing (%) | 1.8% |
| Memory size | 38.1 MiB |
| Within past year (anytime less than 12 months ago) | |
|---|---|
| Within past 2 years (1 year but less than 2 years ago) | |
| Within past 5 years (2 years but less than 5 years ago) | 21212 |
| 5 or more years ago | 16314 |
Length
| Max length | 55 |
|---|---|
| Median length | 50 |
| Mean length | 49.311577 |
| Min length | 19 |
Characters and Unicode
| Total characters | 18449384 |
|---|---|
| Distinct characters | 23 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Within past year (anytime less than 12 months ago) |
|---|---|
| 2nd row | Within past year (anytime less than 12 months ago) |
| 3rd row | Within past year (anytime less than 12 months ago) |
| 4th row | Within past year (anytime less than 12 months ago) |
| 5th row | Within past year (anytime less than 12 months ago) |
Common Values
| Value | Count | Frequency (%) |
| Within past year (anytime less than 12 months ago) | 301086 | |
| Within past 2 years (1 year but less than 2 years ago) | 35527 | 9.3% |
| Within past 5 years (2 years but less than 5 years ago) | 21212 | 5.6% |
| 5 or more years ago | 16314 | 4.3% |
| (Missing) | 6793 | 1.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| ago | 374139 | |
| within | 357825 | |
| past | 357825 | |
| less | 357825 | |
| than | 357825 | |
| year | 336613 | |
| anytime | 301086 | |
| 12 | 301086 | |
| months | 301086 | |
| years | 151004 | |
| Other values (6) | 275898 |
Most occurring characters
| Value | Count | Frequency (%) |
| 3098073 | ||
| a | 1878492 | |
| t | 1732386 | |
| s | 1525565 | 8.3% |
| n | 1317822 | 7.1% |
| e | 1162842 | 6.3% |
| h | 1016736 | 5.5% |
| i | 1016736 | 5.5% |
| y | 788703 | 4.3% |
| o | 707853 | 3.8% |
| Other values (13) | 4204176 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 13489133 | |
| Space Separator | 3098073 | 16.8% |
| Decimal Number | 788703 | 4.3% |
| Close Punctuation | 357825 | 1.9% |
| Uppercase Letter | 357825 | 1.9% |
| Open Punctuation | 357825 | 1.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 1878492 | |
| t | 1732386 | |
| s | 1525565 | |
| n | 1317822 | |
| e | 1162842 | |
| h | 1016736 | |
| i | 1016736 | |
| y | 788703 | |
| o | 707853 | 5.2% |
| m | 618486 | 4.6% |
| Other values (6) | 1723512 |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 393352 | |
| 1 | 336613 | |
| 5 | 58738 | 7.4% |
Space Separator
| Value | Count | Frequency (%) |
| 3098073 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 357825 |
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 357825 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 357825 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 13846958 | |
| Common | 4602426 | 24.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 1878492 | |
| t | 1732386 | |
| s | 1525565 | |
| n | 1317822 | |
| e | 1162842 | |
| h | 1016736 | |
| i | 1016736 | |
| y | 788703 | 5.7% |
| o | 707853 | 5.1% |
| m | 618486 | 4.5% |
| Other values (7) | 2081337 |
Common
| Value | Count | Frequency (%) |
| 3098073 | ||
| 2 | 393352 | 8.5% |
| ) | 357825 | 7.8% |
| ( | 357825 | 7.8% |
| 1 | 336613 | 7.3% |
| 5 | 58738 | 1.3% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18449384 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 3098073 | ||
| a | 1878492 | |
| t | 1732386 | |
| s | 1525565 | 8.3% |
| n | 1317822 | 7.1% |
| e | 1162842 | 6.3% |
| h | 1016736 | 5.5% |
| i | 1016736 | 5.5% |
| y | 788703 | 4.3% |
| o | 707853 | 3.8% |
| Other values (13) | 4204176 |
PhysicalActivities
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 805 |
| Missing (%) | 0.2% |
| Memory size | 744.1 KiB |
| True | |
|---|---|
| False | |
| (Missing) | 805 |
| Value | Count | Frequency (%) |
| True | 288481 | |
| False | 91646 | 24.1% |
| (Missing) | 805 | 0.2% |
SleepHours
Real number (ℝ)
MISSING 
| Distinct | 24 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4376 |
| Missing (%) | 1.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 7.0228226 |
| Minimum | 1 |
|---|---|
| Maximum | 24 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.9 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 5 |
| Q1 | 6 |
| median | 7 |
| Q3 | 8 |
| 95-th percentile | 9 |
| Maximum | 24 |
| Range | 23 |
| Interquartile range (IQR) | 2 |
Descriptive statistics
| Standard deviation | 1.4911012 |
|---|---|
| Coefficient of variation (CV) | 0.2123222 |
| Kurtosis | 8.0076209 |
| Mean | 7.0228226 |
| Median Absolute Deviation (MAD) | 1 |
| Skewness | 0.69860711 |
| Sum | 2644486 |
| Variance | 2.2233827 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7 | 113898 | |
| 8 | 106981 | |
| 6 | 82268 | |
| 5 | 25944 | 6.8% |
| 9 | 18421 | 4.8% |
| 4 | 10642 | 2.8% |
| 10 | 9046 | 2.4% |
| 3 | 2764 | 0.7% |
| 12 | 2561 | 0.7% |
| 2 | 1267 | 0.3% |
| Other values (14) | 2764 | 0.7% |
| (Missing) | 4376 | 1.1% |
| Value | Count | Frequency (%) |
| 1 | 911 | 0.2% |
| 2 | 1267 | 0.3% |
| 3 | 2764 | 0.7% |
| 4 | 10642 | 2.8% |
| 5 | 25944 | 6.8% |
| 6 | 82268 | |
| 7 | 113898 | |
| 8 | 106981 | |
| 9 | 18421 | 4.8% |
| 10 | 9046 | 2.4% |
| Value | Count | Frequency (%) |
| 24 | 34 | < 0.1% |
| 23 | 11 | < 0.1% |
| 22 | 13 | < 0.1% |
| 21 | 2 | < 0.1% |
| 20 | 113 | |
| 19 | 13 | < 0.1% |
| 18 | 143 | |
| 17 | 21 | < 0.1% |
| 16 | 258 | |
| 15 | 262 |
RemovedTeeth
Categorical
MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 9230 |
| Missing (%) | 2.4% |
| Memory size | 24.3 MiB |
| None of them | |
|---|---|
| 1 to 5 | |
| 6 or more, but not all | |
| All |
Length
| Max length | 22 |
|---|---|
| Median length | 12 |
| Mean length | 10.739972 |
| Min length | 3 |
Characters and Unicode
| Total characters | 3992069 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | None of them |
|---|---|
| 2nd row | None of them |
| 3rd row | 1 to 5 |
| 4th row | 1 to 5 |
| 5th row | None of them |
Common Values
| Value | Count | Frequency (%) |
| None of them | 198338 | |
| 1 to 5 | 111375 | |
| 6 or more, but not all | 39884 | 10.5% |
| All | 22105 | 5.8% |
| (Missing) | 9230 | 2.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| none | 198338 | |
| of | 198338 | |
| them | 198338 | |
| 1 | 111375 | |
| to | 111375 | |
| 5 | 111375 | |
| all | 61989 | 5.2% |
| 6 | 39884 | 3.4% |
| or | 39884 | 3.4% |
| more | 39884 | 3.4% |
| Other values (2) | 79768 |
Most occurring characters
| Value | Count | Frequency (%) |
| 818846 | ||
| o | 627703 | |
| e | 436560 | |
| t | 389481 | |
| n | 238222 | 6.0% |
| m | 238222 | 6.0% |
| N | 198338 | 5.0% |
| f | 198338 | 5.0% |
| h | 198338 | 5.0% |
| l | 123978 | 3.1% |
| Other values (9) | 524043 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 2650262 | |
| Space Separator | 818846 | 20.5% |
| Decimal Number | 262634 | 6.6% |
| Uppercase Letter | 220443 | 5.5% |
| Other Punctuation | 39884 | 1.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 627703 | |
| e | 436560 | |
| t | 389481 | |
| n | 238222 | 9.0% |
| m | 238222 | 9.0% |
| f | 198338 | 7.5% |
| h | 198338 | 7.5% |
| l | 123978 | 4.7% |
| r | 79768 | 3.0% |
| b | 39884 | 1.5% |
| Other values (2) | 79768 | 3.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 111375 | |
| 5 | 111375 | |
| 6 | 39884 | 15.2% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 198338 | |
| A | 22105 | 10.0% |
Space Separator
| Value | Count | Frequency (%) |
| 818846 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 39884 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 2870705 | |
| Common | 1121364 | 28.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 627703 | |
| e | 436560 | |
| t | 389481 | |
| n | 238222 | 8.3% |
| m | 238222 | 8.3% |
| N | 198338 | 6.9% |
| f | 198338 | 6.9% |
| h | 198338 | 6.9% |
| l | 123978 | 4.3% |
| r | 79768 | 2.8% |
| Other values (4) | 141757 | 4.9% |
Common
| Value | Count | Frequency (%) |
| 818846 | ||
| 1 | 111375 | 9.9% |
| 5 | 111375 | 9.9% |
| 6 | 39884 | 3.6% |
| , | 39884 | 3.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 3992069 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 818846 | ||
| o | 627703 | |
| e | 436560 | |
| t | 389481 | |
| n | 238222 | 6.0% |
| m | 238222 | 6.0% |
| N | 198338 | 5.0% |
| f | 198338 | 5.0% |
| h | 198338 | 5.0% |
| l | 123978 | 3.1% |
| Other values (9) | 524043 |
HadHeartAttack
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2377 |
| Missing (%) | 0.6% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 21970 |
| (Missing) | 2377 |
| Value | Count | Frequency (%) |
| False | 356585 | |
| True | 21970 | 5.8% |
| (Missing) | 2377 | 0.6% |
HadAngina
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 3647 |
| Missing (%) | 1.0% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 23395 |
| (Missing) | 3647 |
| Value | Count | Frequency (%) |
| False | 353890 | |
| True | 23395 | 6.1% |
| (Missing) | 3647 | 1.0% |
HadStroke
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1184 |
| Missing (%) | 0.3% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 16779 |
| (Missing) | 1184 |
| Value | Count | Frequency (%) |
| False | 362969 | |
| True | 16779 | 4.4% |
| (Missing) | 1184 | 0.3% |
HadAsthma
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1377 |
| Missing (%) | 0.4% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 1377 |
| Value | Count | Frequency (%) |
| False | 321806 | |
| True | 57749 | 15.2% |
| (Missing) | 1377 | 0.4% |
HadSkinCancer
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2573 |
| Missing (%) | 0.7% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 31161 |
| (Missing) | 2573 |
| Value | Count | Frequency (%) |
| False | 347198 | |
| True | 31161 | 8.2% |
| (Missing) | 2573 | 0.7% |
HadCOPD
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1726 |
| Missing (%) | 0.5% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 31165 |
| (Missing) | 1726 |
| Value | Count | Frequency (%) |
| False | 348041 | |
| True | 31165 | 8.2% |
| (Missing) | 1726 | 0.5% |
HadDepressiveDisorder
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2222 |
| Missing (%) | 0.6% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 2222 |
| Value | Count | Frequency (%) |
| False | 298652 | |
| True | 80058 | 21.0% |
| (Missing) | 2222 | 0.6% |
HadKidneyDisease
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1490 |
| Missing (%) | 0.4% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 17909 |
| (Missing) | 1490 |
| Value | Count | Frequency (%) |
| False | 361533 | |
| True | 17909 | 4.7% |
| (Missing) | 1490 | 0.4% |
HadArthritis
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2092 |
| Missing (%) | 0.5% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 2092 |
| Value | Count | Frequency (%) |
| False | 246483 | |
| True | 132357 | |
| (Missing) | 2092 | 0.5% |
HadDiabetes
Categorical
IMBALANCE 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 754 |
| Missing (%) | 0.2% |
| Memory size | 21.9 MiB |
| No | |
|---|---|
| Yes | |
| No, pre-diabetes or borderline diabetes | 9054 |
| Yes, but only during pregnancy (female) | 3268 |
Length
| Max length | 39 |
|---|---|
| Median length | 2 |
| Mean length | 3.339733 |
| Min length | 2 |
Characters and Unicode
| Total characters | 1269693 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 7 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yes |
|---|---|
| 2nd row | No |
| 3rd row | No |
| 4th row | No |
| 5th row | Yes |
Common Values
| Value | Count | Frequency (%) |
| No | 314433 | |
| Yes | 53423 | 14.0% |
| No, pre-diabetes or borderline diabetes | 9054 | 2.4% |
| Yes, but only during pregnancy (female) | 3268 | 0.9% |
| (Missing) | 754 | 0.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| no | 323487 | |
| yes | 56691 | 13.1% |
| pre-diabetes | 9054 | 2.1% |
| or | 9054 | 2.1% |
| borderline | 9054 | 2.1% |
| diabetes | 9054 | 2.1% |
| but | 3268 | 0.8% |
| only | 3268 | 0.8% |
| during | 3268 | 0.8% |
| pregnancy | 3268 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 344863 | |
| N | 323487 | |
| e | 129873 | 10.2% |
| s | 74799 | 5.9% |
| Y | 56691 | 4.5% |
| 52556 | 4.1% | |
| r | 42752 | 3.4% |
| d | 30430 | 2.4% |
| b | 30430 | 2.4% |
| i | 30430 | 2.4% |
| Other values (15) | 153382 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 809047 | |
| Uppercase Letter | 380178 | |
| Space Separator | 52556 | 4.1% |
| Other Punctuation | 12322 | 1.0% |
| Dash Punctuation | 9054 | 0.7% |
| Open Punctuation | 3268 | 0.3% |
| Close Punctuation | 3268 | 0.3% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 344863 | |
| e | 129873 | 16.1% |
| s | 74799 | 9.2% |
| r | 42752 | 5.3% |
| d | 30430 | 3.8% |
| b | 30430 | 3.8% |
| i | 30430 | 3.8% |
| a | 24644 | 3.0% |
| n | 22126 | 2.7% |
| t | 21376 | 2.6% |
| Other values (8) | 57324 | 7.1% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 323487 | |
| Y | 56691 | 14.9% |
Space Separator
| Value | Count | Frequency (%) |
| 52556 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 12322 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 9054 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 3268 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 3268 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 1189225 | |
| Common | 80468 | 6.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 344863 | |
| N | 323487 | |
| e | 129873 | 10.9% |
| s | 74799 | 6.3% |
| Y | 56691 | 4.8% |
| r | 42752 | 3.6% |
| d | 30430 | 2.6% |
| b | 30430 | 2.6% |
| i | 30430 | 2.6% |
| a | 24644 | 2.1% |
| Other values (10) | 100826 | 8.5% |
Common
| Value | Count | Frequency (%) |
| 52556 | ||
| , | 12322 | 15.3% |
| - | 9054 | 11.3% |
| ( | 3268 | 4.1% |
| ) | 3268 | 4.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 1269693 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 344863 | |
| N | 323487 | |
| e | 129873 | 10.2% |
| s | 74799 | 5.9% |
| Y | 56691 | 4.5% |
| 52556 | 4.1% | |
| r | 42752 | 3.4% |
| d | 30430 | 2.4% |
| b | 30430 | 2.4% |
| i | 30430 | 2.4% |
| Other values (15) | 153382 |
DeafOrHardOfHearing
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1388 |
| Missing (%) | 0.4% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 1388 |
| Value | Count | Frequency (%) |
| False | 344325 | |
| True | 35219 | 9.2% |
| (Missing) | 1388 | 0.4% |
BlindOrVisionDifficulty
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1178 |
| Missing (%) | 0.3% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 21459 |
| (Missing) | 1178 |
| Value | Count | Frequency (%) |
| False | 358295 | |
| True | 21459 | 5.6% |
| (Missing) | 1178 | 0.3% |
DifficultyConcentrating
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2540 |
| Missing (%) | 0.7% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 2540 |
| Value | Count | Frequency (%) |
| False | 332695 | |
| True | 45697 | 12.0% |
| (Missing) | 2540 | 0.7% |
DifficultyWalking
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1336 |
| Missing (%) | 0.4% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 1336 |
| Value | Count | Frequency (%) |
| False | 317250 | |
| True | 62346 | 16.4% |
| (Missing) | 1336 | 0.4% |
DifficultyDressingBathing
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 594 |
| Missing (%) | 0.2% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 15356 |
| (Missing) | 594 |
| Value | Count | Frequency (%) |
| False | 364982 | |
| True | 15356 | 4.0% |
| (Missing) | 594 | 0.2% |
DifficultyErrands
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1207 |
| Missing (%) | 0.3% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 29805 |
| (Missing) | 1207 |
| Value | Count | Frequency (%) |
| False | 349920 | |
| True | 29805 | 7.8% |
| (Missing) | 1207 | 0.3% |
SmokerStatus
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2777 |
| Missing (%) | 0.7% |
| Memory size | 26.2 MiB |
| Never smoked | |
|---|---|
| Former smoker | |
| Current smoker - now smokes every day | |
| Current smoker - now smokes some days | 12658 |
Length
| Max length | 37 |
|---|---|
| Median length | 12 |
| Mean length | 15.307731 |
| Min length | 12 |
Characters and Unicode
| Total characters | 5788695 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Never smoked |
|---|---|
| 2nd row | Never smoked |
| 3rd row | Current smoker - now smokes some days |
| 4th row | Never smoked |
| 5th row | Never smoked |
Common Values
| Value | Count | Frequency (%) |
| Never smoked | 227504 | |
| Former smoker | 104810 | |
| Current smoker - now smokes every day | 33183 | 8.7% |
| Current smoker - now smokes some days | 12658 | 3.3% |
| (Missing) | 2777 | 0.7% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| never | 227504 | |
| smoked | 227504 | |
| smoker | 150651 | |
| former | 104810 | |
| current | 45841 | 4.7% |
| 45841 | 4.7% | |
| now | 45841 | 4.7% |
| smokes | 45841 | 4.7% |
| every | 33183 | 3.4% |
| day | 33183 | 3.4% |
| Other values (2) | 25316 | 2.6% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 1108679 | |
| r | 712640 | |
| 607360 | ||
| o | 587305 | |
| m | 541464 | |
| s | 495153 | |
| k | 423996 | 7.3% |
| d | 273345 | 4.7% |
| v | 260687 | 4.5% |
| N | 227504 | 3.9% |
| Other values (9) | 550562 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 4757339 | |
| Space Separator | 607360 | 10.5% |
| Uppercase Letter | 378155 | 6.5% |
| Dash Punctuation | 45841 | 0.8% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1108679 | |
| r | 712640 | |
| o | 587305 | |
| m | 541464 | |
| s | 495153 | |
| k | 423996 | 8.9% |
| d | 273345 | 5.7% |
| v | 260687 | 5.5% |
| n | 91682 | 1.9% |
| y | 79024 | 1.7% |
| Other values (4) | 183364 | 3.9% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 227504 | |
| F | 104810 | |
| C | 45841 | 12.1% |
Space Separator
| Value | Count | Frequency (%) |
| 607360 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 45841 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 5135494 | |
| Common | 653201 | 11.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1108679 | |
| r | 712640 | |
| o | 587305 | |
| m | 541464 | |
| s | 495153 | |
| k | 423996 | 8.3% |
| d | 273345 | 5.3% |
| v | 260687 | 5.1% |
| N | 227504 | 4.4% |
| F | 104810 | 2.0% |
| Other values (7) | 399911 | 7.8% |
Common
| Value | Count | Frequency (%) |
| 607360 | ||
| - | 45841 | 7.0% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 5788695 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 1108679 | |
| r | 712640 | |
| 607360 | ||
| o | 587305 | |
| m | 541464 | |
| s | 495153 | |
| k | 423996 | 7.3% |
| d | 273345 | 4.7% |
| v | 260687 | 4.5% |
| N | 227504 | 3.9% |
| Other values (9) | 550562 |
ECigaretteUsage
Categorical
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1519 |
| Missing (%) | 0.4% |
| Memory size | 33.8 MiB |
| Never used e-cigarettes in my entire life | |
|---|---|
| Not at all (right now) | |
| Use them some days | 10727 |
| Use them every day | 9484 |
Length
| Max length | 41 |
|---|---|
| Median length | 41 |
| Mean length | 36.297939 |
| Min length | 18 |
Characters and Unicode
| Total characters | 13771910 |
|---|---|
| Distinct characters | 25 |
| Distinct categories | 6 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Not at all (right now) |
|---|---|
| 2nd row | Never used e-cigarettes in my entire life |
| 3rd row | Never used e-cigarettes in my entire life |
| 4th row | Never used e-cigarettes in my entire life |
| 5th row | Never used e-cigarettes in my entire life |
Common Values
| Value | Count | Frequency (%) |
| Never used e-cigarettes in my entire life | 289772 | |
| Not at all (right now) | 69430 | 18.2% |
| Use them some days | 10727 | 2.8% |
| Use them every day | 9484 | 2.5% |
| (Missing) | 1519 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| never | 289772 | |
| used | 289772 | |
| e-cigarettes | 289772 | |
| in | 289772 | |
| my | 289772 | |
| entire | 289772 | |
| life | 289772 | |
| now | 69430 | 2.8% |
| right | 69430 | 2.8% |
| all | 69430 | 2.8% |
| Other values (8) | 219704 |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 2678065 | |
| 2076985 | ||
| i | 1228518 | 8.9% |
| t | 1097817 | 8.0% |
| r | 948230 | 6.9% |
| n | 648974 | 4.7% |
| s | 621209 | 4.5% |
| a | 448843 | 3.3% |
| l | 428632 | 3.1% |
| g | 359202 | 2.6% |
| Other values (15) | 3235435 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10886880 | |
| Space Separator | 2076985 | 15.1% |
| Uppercase Letter | 379413 | 2.8% |
| Dash Punctuation | 289772 | 2.1% |
| Open Punctuation | 69430 | 0.5% |
| Close Punctuation | 69430 | 0.5% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 2678065 | |
| i | 1228518 | |
| t | 1097817 | |
| r | 948230 | 8.7% |
| n | 648974 | 6.0% |
| s | 621209 | 5.7% |
| a | 448843 | 4.1% |
| l | 428632 | 3.9% |
| g | 359202 | 3.3% |
| m | 320710 | 2.9% |
| Other values (9) | 2106680 |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 359202 | |
| U | 20211 | 5.3% |
Space Separator
| Value | Count | Frequency (%) |
| 2076985 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 289772 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 69430 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 69430 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11266293 | |
| Common | 2505617 | 18.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 2678065 | |
| i | 1228518 | |
| t | 1097817 | |
| r | 948230 | 8.4% |
| n | 648974 | 5.8% |
| s | 621209 | 5.5% |
| a | 448843 | 4.0% |
| l | 428632 | 3.8% |
| g | 359202 | 3.2% |
| N | 359202 | 3.2% |
| Other values (11) | 2447601 |
Common
| Value | Count | Frequency (%) |
| 2076985 | ||
| - | 289772 | 11.6% |
| ( | 69430 | 2.8% |
| ) | 69430 | 2.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 13771910 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 2678065 | |
| 2076985 | ||
| i | 1228518 | 8.9% |
| t | 1097817 | 8.0% |
| r | 948230 | 6.9% |
| n | 648974 | 4.7% |
| s | 621209 | 4.5% |
| a | 448843 | 3.3% |
| l | 428632 | 3.1% |
| g | 359202 | 2.6% |
| Other values (15) | 3235435 |
ChestScan
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 16179 |
| Missing (%) | 4.2% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 16179 |
| Value | Count | Frequency (%) |
| False | 207889 | |
| True | 156864 | |
| (Missing) | 16179 | 4.2% |
RaceEthnicityCategory
Categorical
MISSING 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 10780 |
| Missing (%) | 2.8% |
| Memory size | 28.5 MiB |
| White only, Non-Hispanic | |
|---|---|
| Hispanic | |
| Black only, Non-Hispanic | |
| Other race only, Non-Hispanic | 18858 |
| Multiracial, Non-Hispanic | 8339 |
Length
| Max length | 29 |
|---|---|
| Median length | 24 |
| Mean length | 22.70031 |
| Min length | 8 |
Characters and Unicode
| Total characters | 8402565 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | White only, Non-Hispanic |
|---|---|
| 2nd row | White only, Non-Hispanic |
| 3rd row | White only, Non-Hispanic |
| 4th row | White only, Non-Hispanic |
| 5th row | White only, Non-Hispanic |
Common Values
| Value | Count | Frequency (%) |
| White only, Non-Hispanic | 277070 | |
| Hispanic | 36482 | 9.6% |
| Black only, Non-Hispanic | 29403 | 7.7% |
| Other race only, Non-Hispanic | 18858 | 5.0% |
| Multiracial, Non-Hispanic | 8339 | 2.2% |
| (Missing) | 10780 | 2.8% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| non-hispanic | 333670 | |
| only | 325331 | |
| white | 277070 | |
| hispanic | 36482 | 3.5% |
| black | 29403 | 2.8% |
| other | 18858 | 1.8% |
| race | 18858 | 1.8% |
| multiracial | 8339 | 0.8% |
Most occurring characters
| Value | Count | Frequency (%) |
| i | 1034052 | 12.3% |
| n | 1029153 | 12.2% |
| 677859 | 8.1% | |
| o | 659001 | 7.8% |
| a | 435091 | 5.2% |
| c | 426752 | 5.1% |
| l | 371412 | 4.4% |
| H | 370152 | 4.4% |
| s | 370152 | 4.4% |
| p | 370152 | 4.4% |
| Other values (14) | 2658789 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 6019874 | |
| Uppercase Letter | 1037492 | 12.3% |
| Space Separator | 677859 | 8.1% |
| Other Punctuation | 333670 | 4.0% |
| Dash Punctuation | 333670 | 4.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1034052 | |
| n | 1029153 | |
| o | 659001 | |
| a | 435091 | |
| c | 426752 | |
| l | 371412 | 6.2% |
| s | 370152 | 6.1% |
| p | 370152 | 6.1% |
| y | 325331 | 5.4% |
| e | 314786 | 5.2% |
| Other values (5) | 683992 |
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 370152 | |
| N | 333670 | |
| W | 277070 | |
| B | 29403 | 2.8% |
| O | 18858 | 1.8% |
| M | 8339 | 0.8% |
Space Separator
| Value | Count | Frequency (%) |
| 677859 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 333670 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 333670 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 7057366 | |
| Common | 1345199 | 16.0% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| i | 1034052 | |
| n | 1029153 | |
| o | 659001 | 9.3% |
| a | 435091 | 6.2% |
| c | 426752 | 6.0% |
| l | 371412 | 5.3% |
| H | 370152 | 5.2% |
| s | 370152 | 5.2% |
| p | 370152 | 5.2% |
| N | 333670 | 4.7% |
| Other values (11) | 1657779 |
Common
| Value | Count | Frequency (%) |
| 677859 | ||
| , | 333670 | |
| - | 333670 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 8402565 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| i | 1034052 | 12.3% |
| n | 1029153 | 12.2% |
| 677859 | 8.1% | |
| o | 659001 | 7.8% |
| a | 435091 | 5.2% |
| c | 426752 | 5.1% |
| l | 371412 | 4.4% |
| H | 370152 | 4.4% |
| s | 370152 | 4.4% |
| p | 370152 | 4.4% |
| Other values (14) | 2658789 |
AgeCategory
Categorical
HIGH CORRELATION  MISSING 
| Distinct | 13 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 6348 |
| Missing (%) | 1.7% |
| Memory size | 24.9 MiB |
| Age 65 to 69 | |
|---|---|
| Age 70 to 74 | |
| Age 60 to 64 | |
| Age 80 or older | |
| Age 55 to 59 | |
| Other values (8) |
Length
| Max length | 15 |
|---|---|
| Median length | 12 |
| Mean length | 12.255195 |
| Min length | 12 |
Characters and Unicode
| Total characters | 4590600 |
|---|---|
| Distinct characters | 19 |
| Distinct categories | 4 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Age 80 or older |
|---|---|
| 2nd row | Age 80 or older |
| 3rd row | Age 40 to 44 |
| 4th row | Age 80 or older |
| 5th row | Age 80 or older |
Common Values
| Value | Count | Frequency (%) |
| Age 65 to 69 | 41071 | |
| Age 70 to 74 | 38192 | |
| Age 60 to 64 | 38166 | |
| Age 80 or older | 31864 | |
| Age 55 to 59 | 31423 | |
| Age 75 to 79 | 28661 | |
| Age 50 to 54 | 28448 | |
| Age 40 to 44 | 25065 | 6.6% |
| Age 45 to 49 | 24036 | 6.3% |
| Age 35 to 39 | 23980 | 6.3% |
| Other values (3) | 63678 |
Length
| Value | Count | Frequency (%) |
| age | 374584 | |
| to | 342720 | |
| 65 | 41071 | 2.7% |
| 69 | 41071 | 2.7% |
| 70 | 38192 | 2.5% |
| 74 | 38192 | 2.5% |
| 60 | 38166 | 2.5% |
| 64 | 38166 | 2.5% |
| 80 | 31864 | 2.1% |
| or | 31864 | 2.1% |
| Other values (19) | 482446 |
Most occurring characters
| Value | Count | Frequency (%) |
| 1123752 | ||
| e | 406448 | 8.9% |
| o | 406448 | 8.9% |
| A | 374584 | 8.2% |
| g | 374584 | 8.2% |
| t | 342720 | 7.5% |
| 5 | 287767 | 6.3% |
| 4 | 272897 | 5.9% |
| 0 | 183403 | 4.0% |
| 9 | 168025 | 3.7% |
| Other values (9) | 649972 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 1657656 | |
| Decimal Number | 1434608 | |
| Space Separator | 1123752 | |
| Uppercase Letter | 374584 | 8.2% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 287767 | |
| 4 | 272897 | |
| 0 | 183403 | |
| 9 | 168025 | |
| 6 | 158474 | |
| 7 | 133706 | |
| 3 | 91296 | 6.4% |
| 2 | 60864 | 4.2% |
| 8 | 55020 | 3.8% |
| 1 | 23156 | 1.6% |
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 406448 | |
| o | 406448 | |
| g | 374584 | |
| t | 342720 | |
| r | 63728 | 3.8% |
| l | 31864 | 1.9% |
| d | 31864 | 1.9% |
Space Separator
| Value | Count | Frequency (%) |
| 1123752 |
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 374584 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 2558360 | |
| Latin | 2032240 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 1123752 | ||
| 5 | 287767 | 11.2% |
| 4 | 272897 | 10.7% |
| 0 | 183403 | 7.2% |
| 9 | 168025 | 6.6% |
| 6 | 158474 | 6.2% |
| 7 | 133706 | 5.2% |
| 3 | 91296 | 3.6% |
| 2 | 60864 | 2.4% |
| 8 | 55020 | 2.2% |
Latin
| Value | Count | Frequency (%) |
| e | 406448 | |
| o | 406448 | |
| A | 374584 | |
| g | 374584 | |
| t | 342720 | |
| r | 63728 | 3.1% |
| l | 31864 | 1.6% |
| d | 31864 | 1.6% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 4590600 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 1123752 | ||
| e | 406448 | 8.9% |
| o | 406448 | 8.9% |
| A | 374584 | 8.2% |
| g | 374584 | 8.2% |
| t | 342720 | 7.5% |
| 5 | 287767 | 6.3% |
| 4 | 272897 | 5.9% |
| 0 | 183403 | 4.0% |
| 9 | 168025 | 3.7% |
| Other values (9) | 649972 |
HeightInMeters
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 108 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 8631 |
| Missing (%) | 2.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 1.7025712 |
| Minimum | 0.91 |
|---|---|
| Maximum | 2.41 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.9 MiB |
Quantile statistics
| Minimum | 0.91 |
|---|---|
| 5-th percentile | 1.52 |
| Q1 | 1.63 |
| median | 1.7 |
| Q3 | 1.78 |
| 95-th percentile | 1.88 |
| Maximum | 2.41 |
| Range | 1.5 |
| Interquartile range (IQR) | 0.15 |
Descriptive statistics
| Standard deviation | 0.10717064 |
|---|---|
| Coefficient of variation (CV) | 0.062946351 |
| Kurtosis | 0.14597114 |
| Mean | 1.7025712 |
| Median Absolute Deviation (MAD) | 0.08 |
| Skewness | 0.025750182 |
| Sum | 633868.95 |
| Variance | 0.011485547 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.68 | 32889 | 8.6% |
| 1.63 | 31802 | 8.3% |
| 1.7 | 30340 | 8.0% |
| 1.65 | 29211 | 7.7% |
| 1.78 | 28710 | 7.5% |
| 1.73 | 27642 | 7.3% |
| 1.75 | 25991 | 6.8% |
| 1.6 | 25347 | 6.7% |
| 1.83 | 25172 | 6.6% |
| 1.57 | 24093 | 6.3% |
| Other values (98) | 91104 |
| Value | Count | Frequency (%) |
| 0.91 | 18 | |
| 0.92 | 1 | < 0.1% |
| 0.95 | 1 | < 0.1% |
| 0.97 | 4 | < 0.1% |
| 0.99 | 1 | < 0.1% |
| 1 | 4 | < 0.1% |
| 1.02 | 2 | < 0.1% |
| 1.03 | 2 | < 0.1% |
| 1.04 | 18 | |
| 1.05 | 26 |
| Value | Count | Frequency (%) |
| 2.41 | 4 | < 0.1% |
| 2.36 | 1 | < 0.1% |
| 2.34 | 2 | < 0.1% |
| 2.29 | 5 | < 0.1% |
| 2.26 | 10 | |
| 2.24 | 1 | < 0.1% |
| 2.21 | 6 | < 0.1% |
| 2.18 | 6 | < 0.1% |
| 2.16 | 9 | |
| 2.13 | 20 |
WeightInKilograms
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 584 |
|---|---|
| Distinct (%) | 0.2% |
| Missing | 20361 |
| Missing (%) | 5.3% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 83.217059 |
| Minimum | 22.68 |
|---|---|
| Maximum | 292.57 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.9 MiB |
Quantile statistics
| Minimum | 22.68 |
|---|---|
| 5-th percentile | 54.43 |
| Q1 | 68.04 |
| median | 81.19 |
| Q3 | 95.25 |
| 95-th percentile | 122.47 |
| Maximum | 292.57 |
| Range | 269.89 |
| Interquartile range (IQR) | 27.21 |
Descriptive statistics
| Standard deviation | 21.485738 |
|---|---|
| Coefficient of variation (CV) | 0.2581891 |
| Kurtosis | 2.6537337 |
| Mean | 83.217059 |
| Median Absolute Deviation (MAD) | 13.15 |
| Skewness | 1.0632404 |
| Sum | 30005658 |
| Variance | 461.63692 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90.72 | 18980 | 5.0% |
| 81.65 | 17515 | 4.6% |
| 68.04 | 15508 | 4.1% |
| 72.57 | 15298 | 4.0% |
| 77.11 | 14218 | 3.7% |
| 86.18 | 12711 | 3.3% |
| 63.5 | 11408 | 3.0% |
| 79.38 | 10405 | 2.7% |
| 99.79 | 9759 | 2.6% |
| 74.84 | 9697 | 2.5% |
| Other values (574) | 225072 | |
| (Missing) | 20361 | 5.3% |
| Value | Count | Frequency (%) |
| 22.68 | 8 | |
| 23 | 1 | < 0.1% |
| 23.13 | 1 | < 0.1% |
| 23.59 | 1 | < 0.1% |
| 24 | 1 | < 0.1% |
| 24.04 | 2 | < 0.1% |
| 24.49 | 1 | < 0.1% |
| 24.95 | 5 | |
| 25.4 | 3 | < 0.1% |
| 25.85 | 3 | < 0.1% |
| Value | Count | Frequency (%) |
| 292.57 | 1 | |
| 290.3 | 1 | |
| 285 | 1 | |
| 281.68 | 1 | |
| 281 | 1 | |
| 280.32 | 1 | |
| 280 | 1 | |
| 278.96 | 1 | |
| 276.24 | 1 | |
| 274.42 | 1 |
BMI
Real number (ℝ)
HIGH CORRELATION  MISSING 
| Distinct | 3887 |
|---|---|
| Distinct (%) | 1.1% |
| Missing | 25608 |
| Missing (%) | 6.7% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 28.586092 |
| Minimum | 12.02 |
|---|---|
| Maximum | 99.64 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 2.9 MiB |
Quantile statistics
| Minimum | 12.02 |
|---|---|
| 5-th percentile | 20.16 |
| Q1 | 24.14 |
| median | 27.44 |
| Q3 | 31.84 |
| 95-th percentile | 40.72 |
| Maximum | 99.64 |
| Range | 87.62 |
| Interquartile range (IQR) | 7.7 |
Descriptive statistics
| Standard deviation | 6.5714124 |
|---|---|
| Coefficient of variation (CV) | 0.22988146 |
| Kurtosis | 4.1968959 |
| Mean | 28.586092 |
| Median Absolute Deviation (MAD) | 3.74 |
| Skewness | 1.3606458 |
| Sum | 10157324 |
| Variance | 43.183461 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 26.63 | 3757 | 1.0% |
| 27.46 | 2945 | 0.8% |
| 24.41 | 2858 | 0.8% |
| 27.44 | 2787 | 0.7% |
| 27.12 | 2738 | 0.7% |
| 25.1 | 2435 | 0.6% |
| 32.28 | 2176 | 0.6% |
| 29.53 | 2081 | 0.5% |
| 29.29 | 2067 | 0.5% |
| 25.84 | 2062 | 0.5% |
| Other values (3877) | 329418 | |
| (Missing) | 25608 | 6.7% |
| Value | Count | Frequency (%) |
| 12.02 | 1 | < 0.1% |
| 12.05 | 1 | < 0.1% |
| 12.06 | 1 | < 0.1% |
| 12.11 | 3 | |
| 12.16 | 4 | |
| 12.19 | 1 | < 0.1% |
| 12.21 | 3 | |
| 12.24 | 1 | < 0.1% |
| 12.27 | 3 | |
| 12.3 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 99.64 | 1 | < 0.1% |
| 97.65 | 4 | |
| 97.43 | 1 | < 0.1% |
| 96.2 | 1 | < 0.1% |
| 95.66 | 2 | |
| 94.66 | 1 | < 0.1% |
| 93.88 | 2 | |
| 93.51 | 1 | < 0.1% |
| 93.41 | 1 | < 0.1% |
| 92.73 | 1 | < 0.1% |
AlcoholDrinkers
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 4901 |
| Missing (%) | 1.3% |
| Memory size | 744.1 KiB |
| True | |
|---|---|
| False | |
| (Missing) | 4901 |
| Value | Count | Frequency (%) |
| True | 196484 | |
| False | 179547 | |
| (Missing) | 4901 | 1.3% |
HIVTesting
Boolean
MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18321 |
| Missing (%) | 4.8% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) | 18321 |
| Value | Count | Frequency (%) |
| False | 239602 | |
| True | 123009 | |
| (Missing) | 18321 | 4.8% |
FluVaxLast12
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 2705 |
| Missing (%) | 0.7% |
| Memory size | 744.1 KiB |
| True | |
|---|---|
| False | |
| (Missing) | 2705 |
| Value | Count | Frequency (%) |
| True | 198580 | |
| False | 179647 | |
| (Missing) | 2705 | 0.7% |
PneumoVaxEver
Boolean
HIGH CORRELATION  MISSING 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 29798 |
| Missing (%) | 7.8% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | |
| (Missing) |
| Value | Count | Frequency (%) |
| False | 204991 | |
| True | 146143 | |
| (Missing) | 29798 | 7.8% |
TetanusLast10Tdap
Categorical
MISSING 
| Distinct | 4 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 34687 |
| Missing (%) | 9.1% |
| Memory size | 33.9 MiB |
| No, did not receive any tetanus shot in the past 10 years | |
|---|---|
| Yes, received tetanus shot but not sure what type | |
| Yes, received Tdap | |
| Yes, received tetanus shot, but not Tdap |
Length
| Max length | 57 |
|---|---|
| Median length | 49 |
| Mean length | 42.512813 |
| Min length | 18 |
Characters and Unicode
| Total characters | 14719849 |
|---|---|
| Distinct characters | 24 |
| Distinct categories | 5 ? |
| Distinct scripts | 2 ? |
| Distinct blocks | 1 ? |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Yes, received tetanus shot but not sure what type |
|---|---|
| 2nd row | No, did not receive any tetanus shot in the past 10 years |
| 3rd row | No, did not receive any tetanus shot in the past 10 years |
| 4th row | No, did not receive any tetanus shot in the past 10 years |
| 5th row | No, did not receive any tetanus shot in the past 10 years |
Common Values
| Value | Count | Frequency (%) |
| No, did not receive any tetanus shot in the past 10 years | 116598 | |
| Yes, received tetanus shot but not sure what type | 108419 | |
| Yes, received Tdap | 94904 | |
| Yes, received tetanus shot, but not Tdap | 26324 | 6.9% |
| (Missing) | 34687 | 9.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| not | 251341 | 8.8% |
| tetanus | 251341 | 8.8% |
| shot | 251341 | 8.8% |
| received | 229647 | 8.1% |
| yes | 229647 | 8.1% |
| but | 134743 | 4.7% |
| tdap | 121228 | 4.3% |
| 10 | 116598 | 4.1% |
| years | 116598 | 4.1% |
| no | 116598 | 4.1% |
| Other values (9) | 1024845 |
Most occurring characters
| Value | Count | Frequency (%) |
| 2497682 | ||
| e | 1969757 | |
| t | 1590141 | |
| s | 1073944 | 7.3% |
| a | 830782 | 5.6% |
| n | 735878 | 5.0% |
| o | 619280 | 4.2% |
| d | 584071 | 4.0% |
| i | 579441 | 3.9% |
| r | 571262 | 3.9% |
| Other values (14) | 3667611 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 11148929 | |
| Space Separator | 2497682 | 17.0% |
| Uppercase Letter | 467473 | 3.2% |
| Other Punctuation | 372569 | 2.5% |
| Decimal Number | 233196 | 1.6% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 1969757 | |
| t | 1590141 | |
| s | 1073944 | |
| a | 830782 | 7.5% |
| n | 735878 | 6.6% |
| o | 619280 | 5.6% |
| d | 584071 | 5.2% |
| i | 579441 | 5.2% |
| r | 571262 | 5.1% |
| u | 494503 | 4.4% |
| Other values (7) | 2099870 |
Uppercase Letter
| Value | Count | Frequency (%) |
| Y | 229647 | |
| T | 121228 | |
| N | 116598 |
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 116598 | |
| 1 | 116598 |
Space Separator
| Value | Count | Frequency (%) |
| 2497682 |
Other Punctuation
| Value | Count | Frequency (%) |
| , | 372569 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 11616402 | |
| Common | 3103447 | 21.1% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 1969757 | |
| t | 1590141 | |
| s | 1073944 | |
| a | 830782 | 7.2% |
| n | 735878 | 6.3% |
| o | 619280 | 5.3% |
| d | 584071 | 5.0% |
| i | 579441 | 5.0% |
| r | 571262 | 4.9% |
| u | 494503 | 4.3% |
| Other values (10) | 2567343 |
Common
| Value | Count | Frequency (%) |
| 2497682 | ||
| , | 372569 | 12.0% |
| 0 | 116598 | 3.8% |
| 1 | 116598 | 3.8% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 14719849 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 2497682 | ||
| e | 1969757 | |
| t | 1590141 | |
| s | 1073944 | 7.3% |
| a | 830782 | 5.6% |
| n | 735878 | 5.0% |
| o | 619280 | 4.2% |
| d | 584071 | 4.0% |
| i | 579441 | 3.9% |
| r | 571262 | 3.9% |
| Other values (14) | 3667611 |
HighRiskLastYear
Boolean
IMBALANCE 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 1476 |
| Missing (%) | 0.4% |
| Memory size | 744.1 KiB |
| False | |
|---|---|
| True | 16499 |
| (Missing) | 1476 |
| Value | Count | Frequency (%) |
| False | 362957 | |
| True | 16499 | 4.3% |
| (Missing) | 1476 | 0.4% |
CovidPos
Boolean
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 372.1 KiB |
| False | |
|---|---|
| True |
| Value | Count | Frequency (%) |
| False | 270055 | |
| True | 110877 |
| PhysicalHealthDays | MentalHealthDays | SleepHours | HeightInMeters | WeightInKilograms | BMI | Sex | GeneralHealth | LastCheckupTime | PhysicalActivities | RemovedTeeth | HadHeartAttack | HadAngina | HadStroke | HadAsthma | HadSkinCancer | HadCOPD | HadDepressiveDisorder | HadKidneyDisease | HadArthritis | HadDiabetes | DeafOrHardOfHearing | BlindOrVisionDifficulty | DifficultyConcentrating | DifficultyWalking | DifficultyDressingBathing | DifficultyErrands | SmokerStatus | ECigaretteUsage | ChestScan | RaceEthnicityCategory | AgeCategory | AlcoholDrinkers | HIVTesting | FluVaxLast12 | PneumoVaxEver | TetanusLast10Tdap | HighRiskLastYear | CovidPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PhysicalHealthDays | 1.000 | 0.312 | -0.084 | -0.056 | 0.059 | 0.099 | 0.064 | 0.312 | 0.041 | 0.244 | 0.118 | 0.142 | 0.154 | 0.136 | 0.135 | 0.035 | 0.224 | 0.219 | 0.143 | 0.248 | 0.094 | 0.110 | 0.156 | 0.250 | 0.439 | 0.336 | 0.344 | 0.075 | 0.031 | 0.195 | 0.021 | 0.041 | 0.132 | 0.066 | 0.024 | 0.106 | 0.027 | 0.030 | 0.075 |
| MentalHealthDays | 0.312 | 1.000 | -0.152 | -0.063 | 0.007 | 0.039 | 0.100 | 0.153 | 0.037 | 0.117 | 0.052 | 0.045 | 0.038 | 0.048 | 0.130 | 0.049 | 0.104 | 0.443 | 0.039 | 0.074 | 0.030 | 0.045 | 0.108 | 0.384 | 0.161 | 0.168 | 0.252 | 0.082 | 0.104 | 0.066 | 0.029 | 0.084 | 0.058 | 0.135 | 0.063 | 0.049 | 0.030 | 0.125 | 0.063 |
| SleepHours | -0.084 | -0.152 | 1.000 | -0.012 | -0.066 | -0.068 | 0.027 | 0.106 | 0.035 | 0.123 | 0.069 | 0.068 | 0.051 | 0.072 | 0.074 | 0.044 | 0.097 | 0.123 | 0.059 | 0.079 | 0.042 | 0.065 | 0.105 | 0.167 | 0.164 | 0.140 | 0.164 | 0.063 | 0.048 | 0.094 | 0.051 | 0.057 | 0.087 | 0.090 | 0.069 | 0.071 | 0.026 | 0.053 | 0.054 |
| HeightInMeters | -0.056 | -0.063 | -0.012 | 1.000 | 0.503 | 0.011 | 0.666 | 0.036 | 0.051 | 0.089 | 0.040 | 0.034 | 0.027 | 0.025 | 0.059 | 0.012 | 0.048 | 0.083 | 0.032 | 0.095 | 0.040 | 0.029 | 0.050 | 0.045 | 0.087 | 0.031 | 0.071 | 0.032 | 0.032 | 0.026 | 0.067 | 0.044 | 0.122 | 0.026 | 0.059 | 0.082 | 0.045 | 0.048 | 0.017 |
| WeightInKilograms | 0.059 | 0.007 | -0.066 | 0.503 | 1.000 | 0.846 | 0.360 | 0.095 | 0.012 | 0.096 | 0.034 | 0.038 | 0.042 | 0.014 | 0.062 | 0.037 | 0.053 | 0.062 | 0.029 | 0.067 | 0.095 | 0.026 | 0.026 | 0.046 | 0.123 | 0.088 | 0.076 | 0.040 | 0.016 | 0.073 | 0.051 | 0.065 | 0.050 | 0.047 | 0.028 | 0.030 | 0.032 | 0.013 | 0.067 |
| BMI | 0.099 | 0.039 | -0.068 | 0.011 | 0.846 | 1.000 | 0.110 | 0.124 | 0.032 | 0.155 | 0.051 | 0.029 | 0.040 | 0.017 | 0.108 | 0.046 | 0.071 | 0.118 | 0.049 | 0.120 | 0.117 | 0.021 | 0.037 | 0.078 | 0.183 | 0.110 | 0.105 | 0.033 | 0.027 | 0.065 | 0.046 | 0.064 | 0.078 | 0.044 | 0.024 | 0.014 | 0.015 | 0.023 | 0.069 |
| Sex | 0.064 | 0.100 | 0.027 | 0.666 | 0.360 | 0.110 | 1.000 | 0.031 | 0.107 | 0.063 | 0.015 | 0.072 | 0.058 | 0.000 | 0.077 | 0.003 | 0.032 | 0.135 | 0.014 | 0.102 | 0.089 | 0.068 | 0.022 | 0.037 | 0.072 | 0.010 | 0.070 | 0.077 | 0.062 | 0.052 | 0.040 | 0.074 | 0.105 | 0.005 | 0.069 | 0.068 | 0.107 | 0.052 | 0.016 |
| GeneralHealth | 0.312 | 0.153 | 0.106 | 0.036 | 0.095 | 0.124 | 0.031 | 1.000 | 0.052 | 0.295 | 0.173 | 0.203 | 0.217 | 0.178 | 0.140 | 0.037 | 0.276 | 0.219 | 0.191 | 0.271 | 0.162 | 0.146 | 0.199 | 0.280 | 0.457 | 0.340 | 0.367 | 0.103 | 0.034 | 0.249 | 0.058 | 0.077 | 0.191 | 0.048 | 0.051 | 0.144 | 0.052 | 0.006 | 0.011 |
| LastCheckupTime | 0.041 | 0.037 | 0.035 | 0.051 | 0.012 | 0.032 | 0.107 | 0.052 | 1.000 | 0.038 | 0.054 | 0.071 | 0.084 | 0.060 | 0.025 | 0.079 | 0.064 | 0.030 | 0.068 | 0.169 | 0.086 | 0.059 | 0.020 | 0.013 | 0.104 | 0.040 | 0.026 | 0.060 | 0.064 | 0.156 | 0.046 | 0.153 | 0.059 | 0.020 | 0.226 | 0.207 | 0.057 | 0.058 | 0.023 |
| PhysicalActivities | 0.244 | 0.117 | 0.123 | 0.089 | 0.096 | 0.155 | 0.063 | 0.295 | 0.038 | 1.000 | 0.199 | 0.086 | 0.079 | 0.082 | 0.046 | 0.005 | 0.140 | 0.081 | 0.086 | 0.128 | 0.147 | 0.076 | 0.095 | 0.109 | 0.286 | 0.172 | 0.193 | 0.118 | 0.024 | 0.102 | 0.072 | 0.127 | 0.161 | 0.023 | 0.025 | 0.053 | 0.106 | 0.021 | 0.014 |
| RemovedTeeth | 0.118 | 0.052 | 0.069 | 0.040 | 0.034 | 0.051 | 0.015 | 0.173 | 0.054 | 0.199 | 1.000 | 0.176 | 0.161 | 0.139 | 0.044 | 0.057 | 0.260 | 0.071 | 0.113 | 0.252 | 0.118 | 0.154 | 0.136 | 0.114 | 0.288 | 0.145 | 0.161 | 0.166 | 0.027 | 0.224 | 0.048 | 0.221 | 0.187 | 0.026 | 0.035 | 0.175 | 0.074 | 0.048 | 0.063 |
| HadHeartAttack | 0.142 | 0.045 | 0.068 | 0.034 | 0.038 | 0.029 | 0.072 | 0.203 | 0.071 | 0.086 | 0.176 | 1.000 | 0.443 | 0.185 | 0.026 | 0.053 | 0.141 | 0.028 | 0.115 | 0.124 | 0.151 | 0.104 | 0.080 | 0.054 | 0.165 | 0.087 | 0.094 | 0.102 | 0.023 | 0.174 | 0.033 | 0.187 | 0.074 | 0.015 | 0.048 | 0.120 | 0.044 | 0.022 | 0.021 |
| HadAngina | 0.154 | 0.038 | 0.051 | 0.027 | 0.042 | 0.040 | 0.058 | 0.217 | 0.084 | 0.079 | 0.161 | 0.443 | 1.000 | 0.152 | 0.036 | 0.082 | 0.159 | 0.033 | 0.149 | 0.152 | 0.158 | 0.114 | 0.074 | 0.049 | 0.176 | 0.093 | 0.094 | 0.091 | 0.033 | 0.189 | 0.050 | 0.215 | 0.066 | 0.025 | 0.080 | 0.158 | 0.032 | 0.028 | 0.016 |
| HadStroke | 0.136 | 0.048 | 0.072 | 0.025 | 0.014 | 0.017 | 0.000 | 0.178 | 0.060 | 0.082 | 0.139 | 0.185 | 0.152 | 1.000 | 0.038 | 0.043 | 0.111 | 0.048 | 0.092 | 0.107 | 0.113 | 0.083 | 0.098 | 0.088 | 0.172 | 0.112 | 0.130 | 0.067 | 0.017 | 0.141 | 0.035 | 0.147 | 0.070 | 0.004 | 0.037 | 0.091 | 0.034 | 0.015 | 0.021 |
| HadAsthma | 0.135 | 0.130 | 0.074 | 0.059 | 0.062 | 0.108 | 0.077 | 0.140 | 0.025 | 0.046 | 0.044 | 0.026 | 0.036 | 0.038 | 1.000 | 0.000 | 0.205 | 0.153 | 0.037 | 0.097 | 0.056 | 0.028 | 0.049 | 0.111 | 0.108 | 0.075 | 0.096 | 0.035 | 0.047 | 0.086 | 0.037 | 0.057 | 0.028 | 0.076 | 0.019 | 0.089 | 0.044 | 0.031 | 0.046 |
| HadSkinCancer | 0.035 | 0.049 | 0.044 | 0.012 | 0.037 | 0.046 | 0.003 | 0.037 | 0.079 | 0.005 | 0.057 | 0.053 | 0.082 | 0.043 | 0.000 | 1.000 | 0.047 | 0.013 | 0.062 | 0.127 | 0.032 | 0.084 | 0.010 | 0.019 | 0.050 | 0.012 | 0.006 | 0.070 | 0.066 | 0.095 | 0.145 | 0.260 | 0.008 | 0.064 | 0.115 | 0.168 | 0.024 | 0.042 | 0.033 |
| HadCOPD | 0.224 | 0.104 | 0.097 | 0.048 | 0.053 | 0.071 | 0.032 | 0.276 | 0.064 | 0.140 | 0.260 | 0.141 | 0.159 | 0.111 | 0.205 | 0.047 | 1.000 | 0.126 | 0.096 | 0.184 | 0.111 | 0.110 | 0.103 | 0.122 | 0.246 | 0.151 | 0.163 | 0.233 | 0.061 | 0.206 | 0.057 | 0.159 | 0.086 | 0.030 | 0.045 | 0.164 | 0.042 | 0.010 | 0.012 |
| HadDepressiveDisorder | 0.219 | 0.443 | 0.123 | 0.083 | 0.062 | 0.118 | 0.135 | 0.219 | 0.030 | 0.081 | 0.071 | 0.028 | 0.033 | 0.048 | 0.153 | 0.013 | 0.126 | 1.000 | 0.052 | 0.121 | 0.056 | 0.038 | 0.087 | 0.340 | 0.152 | 0.133 | 0.209 | 0.122 | 0.140 | 0.072 | 0.063 | 0.119 | 0.028 | 0.141 | 0.017 | 0.037 | 0.056 | 0.089 | 0.042 |
| HadKidneyDisease | 0.143 | 0.039 | 0.059 | 0.032 | 0.029 | 0.049 | 0.014 | 0.191 | 0.068 | 0.086 | 0.113 | 0.115 | 0.149 | 0.092 | 0.037 | 0.062 | 0.096 | 0.052 | 1.000 | 0.132 | 0.169 | 0.078 | 0.074 | 0.053 | 0.161 | 0.088 | 0.100 | 0.044 | 0.024 | 0.128 | 0.024 | 0.145 | 0.082 | 0.002 | 0.067 | 0.131 | 0.015 | 0.019 | 0.008 |
| HadArthritis | 0.248 | 0.074 | 0.079 | 0.095 | 0.067 | 0.120 | 0.102 | 0.271 | 0.169 | 0.128 | 0.252 | 0.124 | 0.152 | 0.107 | 0.097 | 0.127 | 0.184 | 0.121 | 0.132 | 1.000 | 0.174 | 0.152 | 0.097 | 0.102 | 0.333 | 0.153 | 0.152 | 0.137 | 0.064 | 0.231 | 0.120 | 0.400 | 0.096 | 0.027 | 0.148 | 0.272 | 0.054 | 0.065 | 0.036 |
| HadDiabetes | 0.094 | 0.030 | 0.042 | 0.040 | 0.095 | 0.117 | 0.089 | 0.162 | 0.086 | 0.147 | 0.118 | 0.151 | 0.158 | 0.113 | 0.056 | 0.032 | 0.111 | 0.056 | 0.169 | 0.174 | 1.000 | 0.093 | 0.099 | 0.067 | 0.225 | 0.106 | 0.110 | 0.040 | 0.029 | 0.157 | 0.042 | 0.137 | 0.153 | 0.033 | 0.101 | 0.188 | 0.031 | 0.041 | 0.016 |
| DeafOrHardOfHearing | 0.110 | 0.045 | 0.065 | 0.029 | 0.026 | 0.021 | 0.068 | 0.146 | 0.059 | 0.076 | 0.154 | 0.104 | 0.114 | 0.083 | 0.028 | 0.084 | 0.110 | 0.038 | 0.078 | 0.152 | 0.093 | 1.000 | 0.134 | 0.110 | 0.175 | 0.099 | 0.100 | 0.084 | 0.020 | 0.126 | 0.058 | 0.249 | 0.052 | 0.035 | 0.057 | 0.132 | 0.051 | 0.020 | 0.029 |
| BlindOrVisionDifficulty | 0.156 | 0.108 | 0.105 | 0.050 | 0.026 | 0.037 | 0.022 | 0.199 | 0.020 | 0.095 | 0.136 | 0.080 | 0.074 | 0.098 | 0.049 | 0.010 | 0.103 | 0.087 | 0.074 | 0.097 | 0.099 | 0.134 | 1.000 | 0.170 | 0.200 | 0.154 | 0.216 | 0.069 | 0.031 | 0.089 | 0.072 | 0.081 | 0.074 | 0.027 | 0.008 | 0.042 | 0.043 | 0.011 | 0.007 |
| DifficultyConcentrating | 0.250 | 0.384 | 0.167 | 0.045 | 0.046 | 0.078 | 0.037 | 0.280 | 0.013 | 0.109 | 0.114 | 0.054 | 0.049 | 0.088 | 0.111 | 0.019 | 0.122 | 0.340 | 0.053 | 0.102 | 0.067 | 0.110 | 0.170 | 1.000 | 0.219 | 0.204 | 0.313 | 0.124 | 0.137 | 0.086 | 0.060 | 0.094 | 0.066 | 0.090 | 0.044 | 0.011 | 0.025 | 0.090 | 0.026 |
| DifficultyWalking | 0.439 | 0.161 | 0.164 | 0.087 | 0.123 | 0.183 | 0.072 | 0.457 | 0.104 | 0.286 | 0.288 | 0.165 | 0.176 | 0.172 | 0.108 | 0.050 | 0.246 | 0.152 | 0.161 | 0.333 | 0.225 | 0.175 | 0.200 | 0.219 | 1.000 | 0.392 | 0.390 | 0.129 | 0.026 | 0.220 | 0.041 | 0.260 | 0.169 | 0.005 | 0.062 | 0.175 | 0.075 | 0.033 | 0.036 |
| DifficultyDressingBathing | 0.336 | 0.168 | 0.140 | 0.031 | 0.088 | 0.110 | 0.010 | 0.340 | 0.040 | 0.172 | 0.145 | 0.087 | 0.093 | 0.112 | 0.075 | 0.012 | 0.151 | 0.133 | 0.088 | 0.153 | 0.106 | 0.099 | 0.154 | 0.204 | 0.392 | 1.000 | 0.421 | 0.088 | 0.027 | 0.116 | 0.039 | 0.084 | 0.085 | 0.043 | 0.004 | 0.062 | 0.033 | 0.003 | 0.014 |
| DifficultyErrands | 0.344 | 0.252 | 0.164 | 0.071 | 0.076 | 0.105 | 0.070 | 0.367 | 0.026 | 0.193 | 0.161 | 0.094 | 0.094 | 0.130 | 0.096 | 0.006 | 0.163 | 0.209 | 0.100 | 0.152 | 0.110 | 0.100 | 0.216 | 0.313 | 0.390 | 0.421 | 1.000 | 0.100 | 0.061 | 0.131 | 0.035 | 0.088 | 0.118 | 0.047 | 0.002 | 0.074 | 0.041 | 0.026 | 0.016 |
| SmokerStatus | 0.075 | 0.082 | 0.063 | 0.032 | 0.040 | 0.033 | 0.077 | 0.103 | 0.060 | 0.118 | 0.166 | 0.102 | 0.091 | 0.067 | 0.035 | 0.070 | 0.233 | 0.122 | 0.044 | 0.137 | 0.040 | 0.084 | 0.069 | 0.124 | 0.129 | 0.088 | 0.100 | 1.000 | 0.178 | 0.155 | 0.072 | 0.142 | 0.046 | 0.122 | 0.121 | 0.121 | 0.045 | 0.074 | 0.047 |
| ECigaretteUsage | 0.031 | 0.104 | 0.048 | 0.032 | 0.016 | 0.027 | 0.062 | 0.034 | 0.064 | 0.024 | 0.027 | 0.023 | 0.033 | 0.017 | 0.047 | 0.066 | 0.061 | 0.140 | 0.024 | 0.064 | 0.029 | 0.020 | 0.031 | 0.137 | 0.026 | 0.027 | 0.061 | 0.178 | 1.000 | 0.025 | 0.042 | 0.171 | 0.071 | 0.123 | 0.130 | 0.090 | 0.006 | 0.175 | 0.054 |
| ChestScan | 0.195 | 0.066 | 0.094 | 0.026 | 0.073 | 0.065 | 0.052 | 0.249 | 0.156 | 0.102 | 0.224 | 0.174 | 0.189 | 0.141 | 0.086 | 0.095 | 0.206 | 0.072 | 0.128 | 0.231 | 0.157 | 0.126 | 0.089 | 0.086 | 0.220 | 0.116 | 0.131 | 0.155 | 0.025 | 1.000 | 0.082 | 0.287 | 0.092 | 0.046 | 0.095 | 0.228 | 0.060 | 0.025 | 0.005 |
| RaceEthnicityCategory | 0.021 | 0.029 | 0.051 | 0.067 | 0.051 | 0.046 | 0.040 | 0.058 | 0.046 | 0.072 | 0.048 | 0.033 | 0.050 | 0.035 | 0.037 | 0.145 | 0.057 | 0.063 | 0.024 | 0.120 | 0.042 | 0.058 | 0.072 | 0.060 | 0.041 | 0.039 | 0.035 | 0.072 | 0.042 | 0.082 | 1.000 | 0.121 | 0.089 | 0.165 | 0.120 | 0.155 | 0.070 | 0.065 | 0.056 |
| AgeCategory | 0.041 | 0.084 | 0.057 | 0.044 | 0.065 | 0.064 | 0.074 | 0.077 | 0.153 | 0.127 | 0.221 | 0.187 | 0.215 | 0.147 | 0.057 | 0.260 | 0.159 | 0.119 | 0.145 | 0.400 | 0.137 | 0.249 | 0.081 | 0.094 | 0.260 | 0.084 | 0.088 | 0.142 | 0.171 | 0.287 | 0.121 | 1.000 | 0.148 | 0.309 | 0.295 | 0.508 | 0.092 | 0.214 | 0.185 |
| AlcoholDrinkers | 0.132 | 0.058 | 0.087 | 0.122 | 0.050 | 0.078 | 0.105 | 0.191 | 0.059 | 0.161 | 0.187 | 0.074 | 0.066 | 0.070 | 0.028 | 0.008 | 0.086 | 0.028 | 0.082 | 0.096 | 0.153 | 0.052 | 0.074 | 0.066 | 0.169 | 0.085 | 0.118 | 0.046 | 0.071 | 0.092 | 0.089 | 0.148 | 1.000 | 0.054 | 0.007 | 0.079 | 0.084 | 0.078 | 0.041 |
| HIVTesting | 0.066 | 0.135 | 0.090 | 0.026 | 0.047 | 0.044 | 0.005 | 0.048 | 0.020 | 0.023 | 0.026 | 0.015 | 0.025 | 0.004 | 0.076 | 0.064 | 0.030 | 0.141 | 0.002 | 0.027 | 0.033 | 0.035 | 0.027 | 0.090 | 0.005 | 0.043 | 0.047 | 0.122 | 0.123 | 0.046 | 0.165 | 0.309 | 0.054 | 1.000 | 0.045 | 0.074 | 0.121 | 0.134 | 0.077 |
| FluVaxLast12 | 0.024 | 0.063 | 0.069 | 0.059 | 0.028 | 0.024 | 0.069 | 0.051 | 0.226 | 0.025 | 0.035 | 0.048 | 0.080 | 0.037 | 0.019 | 0.115 | 0.045 | 0.017 | 0.067 | 0.148 | 0.101 | 0.057 | 0.008 | 0.044 | 0.062 | 0.004 | 0.002 | 0.121 | 0.130 | 0.095 | 0.120 | 0.295 | 0.007 | 0.045 | 1.000 | 0.344 | 0.135 | 0.065 | 0.068 |
| PneumoVaxEver | 0.106 | 0.049 | 0.071 | 0.082 | 0.030 | 0.014 | 0.068 | 0.144 | 0.207 | 0.053 | 0.175 | 0.120 | 0.158 | 0.091 | 0.089 | 0.168 | 0.164 | 0.037 | 0.131 | 0.272 | 0.188 | 0.132 | 0.042 | 0.011 | 0.175 | 0.062 | 0.074 | 0.121 | 0.090 | 0.228 | 0.155 | 0.508 | 0.079 | 0.074 | 0.344 | 1.000 | 0.126 | 0.063 | 0.077 |
| TetanusLast10Tdap | 0.027 | 0.030 | 0.026 | 0.045 | 0.032 | 0.015 | 0.107 | 0.052 | 0.057 | 0.106 | 0.074 | 0.044 | 0.032 | 0.034 | 0.044 | 0.024 | 0.042 | 0.056 | 0.015 | 0.054 | 0.031 | 0.051 | 0.043 | 0.025 | 0.075 | 0.033 | 0.041 | 0.045 | 0.006 | 0.060 | 0.070 | 0.092 | 0.084 | 0.121 | 0.135 | 0.126 | 1.000 | 0.015 | 0.052 |
| HighRiskLastYear | 0.030 | 0.125 | 0.053 | 0.048 | 0.013 | 0.023 | 0.052 | 0.006 | 0.058 | 0.021 | 0.048 | 0.022 | 0.028 | 0.015 | 0.031 | 0.042 | 0.010 | 0.089 | 0.019 | 0.065 | 0.041 | 0.020 | 0.011 | 0.090 | 0.033 | 0.003 | 0.026 | 0.074 | 0.175 | 0.025 | 0.065 | 0.214 | 0.078 | 0.134 | 0.065 | 0.063 | 0.015 | 1.000 | 0.053 |
| CovidPos | 0.075 | 0.063 | 0.054 | 0.017 | 0.067 | 0.069 | 0.016 | 0.011 | 0.023 | 0.014 | 0.063 | 0.021 | 0.016 | 0.021 | 0.046 | 0.033 | 0.012 | 0.042 | 0.008 | 0.036 | 0.016 | 0.029 | 0.007 | 0.026 | 0.036 | 0.014 | 0.016 | 0.047 | 0.054 | 0.005 | 0.056 | 0.185 | 0.041 | 0.077 | 0.068 | 0.077 | 0.052 | 0.053 | 1.000 |
| State | Sex | GeneralHealth | PhysicalHealthDays | MentalHealthDays | LastCheckupTime | PhysicalActivities | SleepHours | RemovedTeeth | HadHeartAttack | HadAngina | HadStroke | HadAsthma | HadSkinCancer | HadCOPD | HadDepressiveDisorder | HadKidneyDisease | HadArthritis | HadDiabetes | DeafOrHardOfHearing | BlindOrVisionDifficulty | DifficultyConcentrating | DifficultyWalking | DifficultyDressingBathing | DifficultyErrands | SmokerStatus | ECigaretteUsage | ChestScan | RaceEthnicityCategory | AgeCategory | HeightInMeters | WeightInKilograms | BMI | AlcoholDrinkers | HIVTesting | FluVaxLast12 | PneumoVaxEver | TetanusLast10Tdap | HighRiskLastYear | CovidPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Alabama | Female | Very good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | No | 8.0 | NaN | No | No | No | No | No | No | No | No | No | Yes | No | No | No | No | No | No | Never smoked | Not at all (right now) | No | White only, Non-Hispanic | Age 80 or older | NaN | NaN | NaN | No | No | Yes | No | Yes, received tetanus shot but not sure what type | No | No |
| 1 | Alabama | Female | Excellent | 0.0 | 0.0 | NaN | No | 6.0 | NaN | No | No | No | No | Yes | No | No | No | No | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | No | White only, Non-Hispanic | Age 80 or older | 1.60 | 68.04 | 26.57 | No | No | No | No | No, did not receive any tetanus shot in the past 10 years | No | No |
| 2 | Alabama | Female | Excellent | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | NaN | No | No | No | Yes | No | No | No | No | Yes | No | No | No | No | No | No | No | Current smoker - now smokes some days | Never used e-cigarettes in my entire life | Yes | White only, Non-Hispanic | NaN | 1.65 | 63.50 | 23.30 | No | No | Yes | Yes | No, did not receive any tetanus shot in the past 10 years | No | No |
| 3 | Alabama | Female | Fair | 2.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 9.0 | NaN | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | Yes | White only, Non-Hispanic | Age 40 to 44 | 1.57 | 53.98 | 21.77 | Yes | No | No | Yes | No, did not receive any tetanus shot in the past 10 years | No | No |
| 4 | Alabama | Male | Poor | 1.0 | 0.0 | Within past year (anytime less than 12 months ago) | No | 7.0 | NaN | Yes | No | Yes | No | No | No | No | No | No | Yes | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | No | White only, Non-Hispanic | Age 80 or older | 1.80 | 84.82 | 26.08 | No | No | No | Yes | No, did not receive any tetanus shot in the past 10 years | No | No |
| 5 | Alabama | Female | Very good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | NaN | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Former smoker | Never used e-cigarettes in my entire life | No | Black only, Non-Hispanic | Age 80 or older | 1.65 | 62.60 | 22.96 | Yes | No | No | No | No, did not receive any tetanus shot in the past 10 years | No | No |
| 6 | Alabama | Female | Good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | No | 8.0 | NaN | No | No | No | No | No | No | No | No | Yes | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | Yes | White only, Non-Hispanic | Age 80 or older | 1.63 | 73.48 | 27.81 | No | No | Yes | Yes | Yes, received tetanus shot but not sure what type | No | No |
| 7 | Alabama | Female | Good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 6.0 | NaN | No | No | No | No | Yes | No | No | No | Yes | No | No | Yes | No | Yes | No | No | Former smoker | Not at all (right now) | NaN | White only, Non-Hispanic | Age 75 to 79 | 1.70 | NaN | NaN | No | Yes | No | No | Yes, received tetanus shot but not sure what type | No | No |
| 8 | Alabama | Female | Good | 1.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | NaN | No | No | No | No | No | No | No | Yes | No | Yes | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | NaN | White only, Non-Hispanic | Age 70 to 74 | 1.68 | 81.65 | 29.05 | Yes | NaN | Yes | Yes | No, did not receive any tetanus shot in the past 10 years | No | No |
| 9 | Alabama | Female | Fair | 8.0 | 9.0 | Within past year (anytime less than 12 months ago) | No | 8.0 | NaN | No | No | No | No | No | No | No | No | No | No | NaN | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | Yes | White only, Non-Hispanic | Age 80 or older | 1.60 | 74.84 | 29.23 | No | No | Yes | Yes | Yes, received tetanus shot but not sure what type | No | No |
| State | Sex | GeneralHealth | PhysicalHealthDays | MentalHealthDays | LastCheckupTime | PhysicalActivities | SleepHours | RemovedTeeth | HadHeartAttack | HadAngina | HadStroke | HadAsthma | HadSkinCancer | HadCOPD | HadDepressiveDisorder | HadKidneyDisease | HadArthritis | HadDiabetes | DeafOrHardOfHearing | BlindOrVisionDifficulty | DifficultyConcentrating | DifficultyWalking | DifficultyDressingBathing | DifficultyErrands | SmokerStatus | ECigaretteUsage | ChestScan | RaceEthnicityCategory | AgeCategory | HeightInMeters | WeightInKilograms | BMI | AlcoholDrinkers | HIVTesting | FluVaxLast12 | PneumoVaxEver | TetanusLast10Tdap | HighRiskLastYear | CovidPos | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 380922 | Virgin Islands | Male | Fair | 10.0 | 0.0 | Within past 2 years (1 year but less than 2 years ago) | Yes | NaN | 1 to 5 | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Current smoker - now smokes some days | Never used e-cigarettes in my entire life | No | Black only, Non-Hispanic | Age 50 to 54 | 1.80 | 90.72 | 27.89 | Yes | Yes | No | No | No, did not receive any tetanus shot in the past 10 years | No | Yes |
| 380923 | Virgin Islands | Male | Fair | NaN | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 6.0 | 1 to 5 | No | No | No | No | No | Yes | Yes | No | Yes | No | No | No | No | No | No | No | Former smoker | Not at all (right now) | Yes | Black only, Non-Hispanic | Age 35 to 39 | 1.85 | 104.33 | 30.34 | No | Yes | No | NaN | Yes, received tetanus shot but not sure what type | No | Yes |
| 380924 | Virgin Islands | Female | Fair | 0.0 | 10.0 | Within past year (anytime less than 12 months ago) | Yes | 6.0 | 1 to 5 | No | No | No | No | No | No | No | No | Yes | No | No | No | No | Yes | No | Yes | Never smoked | Never used e-cigarettes in my entire life | NaN | Black only, Non-Hispanic | Age 80 or older | 1.65 | 88.45 | 32.45 | Yes | NaN | No | No | NaN | No | Yes |
| 380925 | Virgin Islands | Male | Good | 14.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | NaN | None of them | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Former smoker | Never used e-cigarettes in my entire life | Yes | Hispanic | Age 30 to 34 | 1.83 | 95.25 | 28.48 | No | Yes | No | No | No, did not receive any tetanus shot in the past 10 years | No | Yes |
| 380926 | Virgin Islands | Male | Fair | 30.0 | 1.0 | Within past year (anytime less than 12 months ago) | No | 6.0 | 6 or more, but not all | No | NaN | Yes | No | No | Yes | No | No | No | No, pre-diabetes or borderline diabetes | No | No | No | No | No | No | Former smoker | Never used e-cigarettes in my entire life | Yes | White only, Non-Hispanic | Age 70 to 74 | 1.78 | 70.31 | 22.24 | No | No | Yes | NaN | Yes, received tetanus shot but not sure what type | No | Yes |
| 380927 | Virgin Islands | Female | Fair | 0.0 | 7.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | None of them | No | No | No | No | No | No | Yes | No | No | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | No | Black only, Non-Hispanic | Age 25 to 29 | 1.93 | 90.72 | 24.34 | No | No | No | No | No, did not receive any tetanus shot in the past 10 years | No | Yes |
| 380928 | Virgin Islands | Male | Good | 0.0 | 15.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | 1 to 5 | No | No | Yes | No | No | No | No | No | Yes | Yes | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | No | Multiracial, Non-Hispanic | Age 65 to 69 | 1.68 | 83.91 | 29.86 | Yes | Yes | Yes | Yes | Yes, received tetanus shot but not sure what type | No | Yes |
| 380929 | Virgin Islands | Male | Good | 0.0 | 0.0 | Within past 2 years (1 year but less than 2 years ago) | Yes | 8.0 | None of them | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | No | White only, Non-Hispanic | Age 30 to 34 | 1.83 | 104.33 | 31.19 | Yes | NaN | No | No | NaN | No | Yes |
| 380930 | Virgin Islands | Female | Good | 0.0 | 3.0 | Within past 2 years (1 year but less than 2 years ago) | Yes | 6.0 | None of them | No | No | No | Yes | No | No | Yes | No | No | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | Yes | Black only, Non-Hispanic | Age 18 to 24 | 1.65 | 69.85 | 25.63 | NaN | Yes | No | No | No, did not receive any tetanus shot in the past 10 years | No | Yes |
| 380931 | Virgin Islands | Male | Very good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | No | 5.0 | None of them | Yes | No | No | Yes | No | No | No | No | No | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | Yes | Black only, Non-Hispanic | Age 70 to 74 | 1.83 | 108.86 | 32.55 | No | Yes | Yes | Yes | No, did not receive any tetanus shot in the past 10 years | No | Yes |
Most frequently occurring
| State | Sex | GeneralHealth | PhysicalHealthDays | MentalHealthDays | LastCheckupTime | PhysicalActivities | SleepHours | RemovedTeeth | HadHeartAttack | HadAngina | HadStroke | HadAsthma | HadSkinCancer | HadCOPD | HadDepressiveDisorder | HadKidneyDisease | HadArthritis | HadDiabetes | DeafOrHardOfHearing | BlindOrVisionDifficulty | DifficultyConcentrating | DifficultyWalking | DifficultyDressingBathing | DifficultyErrands | SmokerStatus | ECigaretteUsage | ChestScan | RaceEthnicityCategory | AgeCategory | HeightInMeters | WeightInKilograms | BMI | AlcoholDrinkers | HIVTesting | FluVaxLast12 | PneumoVaxEver | TetanusLast10Tdap | HighRiskLastYear | CovidPos | # duplicates | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Arizona | Female | Excellent | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | None of them | No | No | No | No | Yes | No | No | No | Yes | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | No | White only, Non-Hispanic | Age 75 to 79 | 1.63 | 56.70 | 21.46 | Yes | No | Yes | Yes | Yes, received Tdap | No | No | 2 |
| 1 | Maryland | Female | Good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 8.0 | None of them | No | No | No | Yes | No | No | No | No | No | No | No | No | No | No | No | No | Former smoker | Never used e-cigarettes in my entire life | Yes | White only, Non-Hispanic | Age 65 to 69 | 1.65 | 45.36 | 16.64 | Yes | No | Yes | Yes | Yes, received Tdap | No | No | 2 |
| 2 | Maryland | Male | Excellent | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 8.0 | None of them | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | No | White only, Non-Hispanic | Age 50 to 54 | 1.75 | 65.77 | 21.41 | Yes | No | Yes | Yes | Yes, received Tdap | No | No | 2 |
| 3 | Montana | Male | Good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 5.0 | None of them | No | No | No | No | No | No | No | No | No | Yes | Yes | No | No | No | No | No | Former smoker | Not at all (right now) | No | NaN | Age 50 to 54 | 1.75 | 97.52 | 31.75 | Yes | Yes | Yes | No | Yes, received tetanus shot but not sure what type | No | No | 2 |
| 4 | New Jersey | Female | Excellent | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | None of them | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | No | Former smoker | Never used e-cigarettes in my entire life | No | White only, Non-Hispanic | Age 50 to 54 | NaN | NaN | NaN | Yes | No | Yes | No | Yes, received tetanus shot but not sure what type | No | Yes | 2 |
| 5 | New Jersey | Male | Good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | No | 8.0 | 6 or more, but not all | No | No | Yes | No | No | No | No | No | No | Yes | No | No | No | No | No | No | Former smoker | Never used e-cigarettes in my entire life | Yes | White only, Non-Hispanic | Age 75 to 79 | 1.63 | 80.74 | 30.55 | Yes | No | No | No | No, did not receive any tetanus shot in the past 10 years | No | No | 2 |
| 6 | Rhode Island | Female | Very good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | 1 to 5 | No | No | No | Yes | No | No | No | No | No | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | No | White only, Non-Hispanic | Age 75 to 79 | 1.57 | 68.04 | 27.44 | Yes | No | Yes | Yes | No, did not receive any tetanus shot in the past 10 years | No | No | 2 |
| 7 | South Dakota | Female | Fair | 30.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | None of them | No | No | No | No | No | No | No | No | No | Yes | No | No | No | Yes | No | No | Never smoked | Never used e-cigarettes in my entire life | Yes | White only, Non-Hispanic | Age 70 to 74 | 1.75 | 90.72 | 29.53 | No | No | Yes | Yes | No, did not receive any tetanus shot in the past 10 years | No | No | 2 |
| 8 | Vermont | Female | Very good | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 9.0 | None of them | No | No | No | No | No | No | No | No | Yes | No | No | No | No | Yes | No | No | Former smoker | Never used e-cigarettes in my entire life | Yes | White only, Non-Hispanic | Age 70 to 74 | 1.65 | 79.38 | 29.12 | Yes | No | Yes | Yes | Yes, received tetanus shot but not sure what type | No | No | 2 |
| 9 | Washington | Male | Excellent | 0.0 | 0.0 | Within past year (anytime less than 12 months ago) | Yes | 7.0 | None of them | No | No | No | No | Yes | No | No | No | No | No | No | No | No | No | No | No | Never smoked | Never used e-cigarettes in my entire life | No | White only, Non-Hispanic | Age 60 to 64 | 1.80 | 77.11 | 23.71 | Yes | Yes | Yes | No | Yes, received Tdap | No | No | 2 |